Skip to content

Conversation

@pemeliya
Copy link
Contributor

@pemeliya pemeliya commented Nov 28, 2025

Added CommandBuffer support for Convolution ops
Graph capture of convolutions is by default disabled (since not all convolutions can be captured).
It can be enabled through a newly added flag: '--xla_gpu_enable_command_buffers=+convolution"

🎯 Justification
This op wase missing for whatever reason: this results in graph fragmentation especially for large models. Hence one gets several (sometimes many) execution graphs instead of just one.

🚀 Kind of Contribution
✨ New Feature

🧪 Unit Tests:
Added new subtest to xla/service/gpu/transforms/command_buffer_scheduling_test.cc and xla/backends/gpu/runtime/command_buffer_conversion_pass_test.cc

This is a fork of the original PR which was reverted due to conv problems: #32053
Now convolutions capture is by default disabled.

@beckerhe , @ezhulenev, @dimitar-asenov please have another look !

added UTs  and convolution command

test fixes

added rebase fixes

capture only those convolution targets which are explictly

Revert "adding coll permute and convolution to command buffers"

This reverts commit 75847e6.

added conv to command buffers

fixing build and test

fixing build

rewritten ConvolutionCmd, adapted command_buffer_conv_pass

some cosmetics
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant